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^ , Abstract 
o ■ 

O ' A local equilibrium approach for the kinetics of a simplified protein folding model, whose equi- 



librium thermodynamics is exactly solvable, was developed in Important properties of this 
approach are (i) the free energy decreases with time, (ii) the exact equilibrium is recovered in 



' the infinite time limit, (iii) the equilibration rate is an upper bound of the exact one and (iv) 

d ' 

^ • computational complexity is polynomial in the number of variables. Moreover, (v) this method is 

. equivalent to another approximate approach to the kinetics: the path probability method. In this 

o , 

O , paper we give detailed rigorous proofs for the above results. 
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I. INTRODUCTION 



The present paper contains details of the proofs of a few rigorous results for the local 
equilibrium approach to the kinetics of the Wako-Saito-Munoz-Eaton (WSME) model of 
protein folding, which was developed in This is a highly simplified model where one aims 
to describe the kinetics under the assumption that it is mainly determined by the structure 
of the native state, whose knowledge is assumed. It is a one-dimensional model, with long- 
range, many-body interactions, where a binary variable is associated to each peptide bond 
(the bond between consecutive aminoacids). Two aminoacids can interact only if they are 
in contact in the native state and all the peptide bonds between them are in the ordered 
state. Moreover an entropic cost is associated to each ordered bond. 

A homogeneous version of the model was first introduced in 1978 by Wako and Saito 
, who solved exactly the equilibrium thermodynamics and studied the appearance 
of phase transitions in the thermodynamic limit. The full heterogeneous case was later 



considered by Munoz, Eaton and coworkers 0, [g], who introduced the single (double, 
triple) sequence approximations, i.e. they considered only configurations with at most one 
(two, three) stretches of consecutive ordered bonds, for both the equilibrium and the kinetics. 



The equilibrium problem has been subsequently studied in [TT], with exact solutions for 
homogeneous /5-hairpin and a-helix structures, mean field approximation and Monte Carlo 
simulations. The exact solution for the equilibrium in the full heterogeneous case was given 
in Q]. Moreover, in it was shown that the equilibrium probability has an important 
actorization property, which implies the exactness of the cluster variation method (CVM) 



3, 
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12|, a variational method for the study of lattice systems in statistical mechanics. 



. -lecently t 



13|, 



In 
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le model has found various applications in the study of the kinetics o: 
and, interestingly enough, in a problem of strained epitaxy 18 



real proteins 



19 



20|. 



]| we have studied the kinetics of the WSME model in a master equation framework 



by means of a local equilibrium approach 21 



231], by assuming that the probability 



distribution factors at any time in the same way as the equilibrium one. A few rigorous 
results about the local equilibrium kinetics were reported without a detailed proof, since the 
main purpose of that paper was to show the relevance of such an approach for the kinetics 
of protein folding. In the present paper we shall give detailed proofs of those results. Sec. 
HI] will be dedicated to a description of the model and our approach to the kinetics. In Sec. 
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mil we will concentrate on the properties of the local equilibrium approximation. In Sec. 
nil Al we shall show that the free energy never increases with time, that is a feature of the 
exact solution which is preserved by our technique. This result will allow us to prove, in 
Sec. IIIIB[ that the exact equilibrium probability distribution is recovered in the long time 
limit. The study of the asymptotic behaviour will then lead us to define an approximate 
relaxation rate, that is an upper bound of the exact one, as it will be discussed in Sec. IIII CI 
In Sec. HV] we will focus on a physically relevant example of kinetics, in order to show how 
the local equilibrium approximation reduces the complexity of the kinetic problem, from 
exponential to polynomial in the number of variables. Finally Sec. |V] will be devoted to the 
proof of the equivalence between our local equilibrium approach and the path probability 



method 



24 



25 



26j, the generalization of the CVM to the kinetics. It must be noticed that 



the only property of the WSME model which underlies the results of Sees. IIIII and |V] is that 
the equilibrium problem can be solved exactly by means of the CVM, i.e. its equilibrium 
distribution factors in some ways. As a consequence, our conclusions are valid for any other 
model with the same property. 



II. THE MODEL AND THE LOCAL EQUILIBRIUM KINETICS 

The model describes a protein of + 1 aminoacids as a chain of A^ peptide bonds 
(connecting consecutive aminoacids) that can live in only two states (native and unfolded) 
and can interact only if they are in contact in the native structure and if all bonds in the chain 
between them are native. To each bond is associated a binary variable mj, i G {1, . . . , A^}, 
with values 0, 1 for unfolded and native state respectively. The effective free energy of the 
model (often improperly called Hamiltonian) reads 

Af-l N j N 

HN{m) = 5Z n " X] ~ 

i=l j=i+l k=i i=l 

where R is the gas constant and T the absolute temperature. The first term assigns an 
energy < to the contact (defined as in 0, Q]) between bonds i and j if this takes place 
in the native structure (Aj ,,■ = 1 in this case and Aj j = otherwise). The second term 
represents the entropic cost > of ordering bond i. 

In order to solve exactly the equilibrium problem it has been found useful js, 9] to map 
the one-dimensional WSME model onto a two-dimensional model through the introduction 



of the variables Xij = Y[k=i '^fc which satisfy the short-range constraints Xij = Xjj.iXj+i j 
for I < i < j < N. These can be associated to the nodes of a triangular shaped portion A 
of a two-dimensional square lattice, defined by A = G : 1 < i < j < A^}. Let C\ 

be the set of all configurations x on A that fulfil previous constraints and rewrite Eq. ([1]) 
(divided by RT and leaving apart an additive constant) in the form 

Ha{x) = hijXij. (2) 

(«,i)GA 

/^From now on we shall concentrate on the above equation, where the hij^s can be tem- 
perature dependent, without referring to the original protein folding (or strained epitaxy) 
problem. We will denote by p\ the corresponding Boltzmann distribution and by the 
partition function: 

P^(x) = ?^Pt£^ (3) 



and 



Zj,= J2 exp[-Hj,{x)]. (4) 

x€Ca 



This distribution has been shown to factor as 

piix) = iiK{x.)r, (5) 

where ^ is a set of local clusters a C A made of all square plaquettes (a^ = 1), the triangles 
lying on the diagonal boundary (a^ = 1) and their intersections, that is internal nearest- 
neighbour pairs (oq, = —1) and single nodes (a^ = 1). It can be easily checked that the 



coefficients are the Mobius numbers [Ul] for the set A. For each cluster a G ^ we denote 
by Xa {xA\a) the projection of x onto a (A \ a), by Ca the set of all configurations on a that 
are projections of configurations on A, and define the cluster equilibrium probability as the 
marginal distribution 

Notice that by definition X(^ij) = Xij. It is important to point out that cluster probabilities 
allow to reconstruct a probability on the whole lattice. Let T>\ be the set of all cluster 
probabilities p = {pa}aeA relative to A satisfying the compatibility conditions ppi^xp) = 
Pai^a) for a, P E A and P <Z a. Then, for p G V^, consider the function Pa[p] : C\ — *• 
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]R+ defined as 

Pa[p]{x) = l[[Pa{Xc.)r- . (7) 

It can be checked that, due to the nature of the lattice and constraints, Pa[p] is a probabihty 
on C\ and Pa is its marginal distribution relative to a [9]. 

As a consequence of Eq. ([5]), the equihbrium problem can be solved exactly Qj by means of 
the CVM. Since the Boltzmann distribution minimizes the free energy and factors, restricting 
the variational principle to distributions with the same property, i.e. of the form Pa[p], one 
finds that G Pa is the minimum of the Kikuchi free energy 

Pj^Ip] = «a ^[\npa{Xa) + haiXa)]PaiXa) (8) 

aeA Xa 

with respect to p G Pa. Here ha are defined by ha{xa) = J2{ij)ea follows that 

Ha{x) = XlaGyi This variational approach has been used in as the starting 
point for a very accurate treatment of the kinetics. 

The kinetic problem has been stated in the framework of a master equation approach. 
Denoting by W\{x' —>■ x) > the transition probability per unit time from the state x' to 
X ^ x', we have to solve 

j^pUx) = J2 Wa{x' - x)pi{x') (9) 
where from probability normalization it follows that Wa{x —>■ x) has to be such that 

Wa{x' ^ x) = 0. (10) 



27( 1 that if the Boltzmann distribution is a stationary point for the equation, 



It is known 
that is 

J2 WAix' ^ x)plix') = 0, (11) 

x'gCa 

and Wa is irreducible, then the solution of the master equation converges to p\ in the long 
time limit. In the following we shall assume that these conditions hold. 



In 



21 



l|_we have studied the above problem by means of a local equilibrium approach 



22, l23|, that is we have assumed that, provided the initial condition p^ factors according 
to Eq. ([5]) as the equilibrium probability, the solution of the master equation factors in 
the same way at any subsequent time. We are therefore dealing with a kinetic problem in 
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a restricted probability space. With this simphfication the master equation yields for the 
cluster probabilities 

j/aM = Yl ^-(^' ^ ^c.)PA[p']ix% (12) 
x'&Ca 

where 

W^{x' ^x^) = Y, ^a(^' (13) 

By taking marginals of Eq. f|T2l) one can verify that G "Da if G Pa ? i-e. the above 
evolution preserves the compatibility conditions between the cluster probabilities. 

In addition, it has been shown in [9] (where explicit expressions are given) that the 
equilibrium cluster probabilities can be written as linear functions of the expectation values 

for (i, j) G A and any a containing the node (i, j). By introducing in the same way the 
time-dependent variables ^i,j{t) our kinetic problem will be turned into 

j^m = mm (15) 

where, if ^ = {(,i,j}{i,j)eA is the collection of the above expectation values corresponding to 
P^ f = {fi,j}(i,j)eA is defined by 

kjiO = E E ^'■.W^a(^' ^ x)PA[p]ix'). (16) 

x€Ca x'GCa 

/ will be computed explicitly in Sec. lIVI for a given, physically relevant, choice of W\. Notice 
that, in the framework of the WSME protein folding model, C,ij{t) is but the probability of 
the stretch from i to j being native at time t. 



III. PROPERTIES OF THE LOCAL EQUILIBRIUM KINETICS 
A. Time behaviour of the free energy 

We start the study of the properties of the local equilibrium approach by showing that 
the free energy, which reduces to the Kikuchi free energy ([8]) due to the approximation which 
assumes that the probability factors at any time in the same way as the equilibrium one, is 
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never increasing. This is a fundamental property which holds for an exact solution and is 
preserved by our approximation. 

As customary 27(] we add to the Kikuchi free energy the constant In Za, where Za is the 
partition function (jl]), and with a slight abuse of notation we indicate again by Fa this new 
function. Notice that, using Eqs. ([7]), (jSD, ([3]) and the relation H\{x) = XlaG^ 
one can find that for p G V\ 



^aa^/i„(xa)p„(x„) +lnZA = y^Qa K{xa)PA[p\{x) + In 

= Y,[H^{x)+\nZA\P^\p\{2 

x&Ca 

xeCA 

= -^a„ ^ lnp^(a;„)PA[p](5 

aeA x€Ca 
= -'^aa'^lnpl{Xa)Pa{Xa] 



aeA Xa 



and obtain thus 



Fa[p\ = ^ aa^Paixa) In 



Pa \Xc 



(17) 



(18) 



a<=A Xa 

The function Fa, that we shall continue to call Kikuchi free energy, is not negative since 
F'aIp] > -^AiP*^] = 0, and in the following will play the role of a Lyapunov function 28|. We 
now study its time behaviour, mimicking the exact approach 27^ . Our results can easily be 
rewritten for a discrete time formulation. 

Proposition: If is the solution of Eq. f|T2|) with initial condition then a function 
La : T>\ M exists such that 

i) Lj,[p] < Wpe Pa; 
a) La[p] = <^=^ p =p^; 



til) 



dt 



Fa[p'] = La[p' 



Proof. For p G let 

^a[p\{x) = Pa[p]{x)/pI{x). 
The time derivative of the free energy can be written as 



(19) 



d 
dt 



a£A Xo 



Pci\Xa) 



j/aM 



aG-4 Xa 



In 



d 
It 



(20) 



where the last step follows from normalization of cluster probabilities. Using Eqs. (|T2|) . ([T3|) 
and (IT^ respectively we have 



EE 



In 



,oG.4 



5^ H^„(a;'^a;„K(a;')^A[p*](x') 



Oq, In 



W^A(x'^a;K(x')^A[p*](x') 



= E E ^A(x'^xK(x')V^A[p*](x')lnV^A[p*](x), 

xgCa ^''gCa 

and statement in) of our proposition is obtained by setting 

La[p] = Wa{x' ^ x)pX{x')iJA[p]{x')\mpA[p]{x). 

xgCa x'gCa 



(21) 



(22) 



Statements «) and m) follow from the equivalence (to be proven later) of the above definition 
and 



x&Ca x'eCA\{x} 



^A\p]{x')-^A\p]{x)+^ljAlp]{x')ln'^^^P^^'^^ 



V'a[p](x') 



(23) 



The inequality a — b + a\n—<0 (a,b G M+) implies i). In order to obtain ii) observe 

a 

that the previous inequality reduces to an equality if and only if a = 6 and then, by the 
irreducibility of W\, L^lp] vanishes if and only if iPa[p]{x) is independent of x. Pa[p] = 
Pa follows by normalization and p = by computing the marginal distributions of each 
member. 

We complete our proof by showing the equivalence between Eq. fl22|) and fl23l) . From now 
on let ip he a positive function defined on Ca- Notice first that, thanks to Eqs. (fTOl) and 
(fTTl). we have 







E 

x'&Ca 



X) 



.xeCA 



pl(x')^(x') - J2 



xGCa 



J2 Wa{x' ^ x)pl{x') 



.x'GCa 



ip{x) 



= E E ^a(^' ~^ x)pI{x)^{x) + ^ Wa{x x)pI{x)^{x) + 
xsCa x'£Ca\{x} xeCA 

- ^ ^ WAix' x)p%{x')4!{x) - ^ WAix x)pA(a;)V'(a;) 

x&Ca x'eCA\{x} x&Ca 

= E E W^A(a;'^a;)pl(x')[^(x')-V^(x)]. 

x£Ca x'gCa\{x} 



(24) 



In addition, using again Eq. (fTOl) . we can see that 



Wa{x' x)pI{x')^{x') In V( 

xGCa x'gCa 



X) 



XI 



p\{x')il){x') \nil){x') 



Y Y ^ X)pl{x')i){x') \Yii){x) 

WA(a;' ^ x)p^(a;')-?/'(x') ln'?/'(x) + ^ Wa(x ^ a;)p^(x)^/'(x) ln^/'(x) + 

xGCa x'gCa\{3:^} a:GCA 

y^ y^ WA(a;' ^ ln-?/'(x') — Wa(x — x)p\{x)ip{x) \nip{x) + 

xGCa x'GCa\{2:} x&Ca 



xGCa x'(^Ca\{x} 



iplx') 



(25) 



B. Exactness of the equilibrium state 



In spite of an approximate picture for nonequilibrium states, the local equilibrium ap- 
proach allows to recover the exact probability distribution in the long time limit. This is 
due to the role of Lyapunov function [28| played by the Kikuchi free energy Fa, as stated in 
the following proposition. 

Proposition: If p* is the solution of Eq. (JT2ll with initial condition then 

(26) 



lim p* = p^- 

t—i+OO 



Proof, p^ is the stationary point of Eq. (fT2!) . as a consequence of Eqs. (j5]) and (ITT!) . 
We already know that a non-positive function La exists, such that L\[p] = if and only if 



p = p'^ and La[p*] is the derivative of the free energy Fa 
is the minimum of Fa. Then a theorem by Lyapunov 
stable. 



with respect to time. Moreover p^ 
ensures that p'^ is asymptotically 



C. Asymptotic behaviour 

In the present section we shall discuss how the local equilibrium kinetics approaches the 
exact equilibrium state. This will lead us to define a relaxation rate for our approximation, 
and we shall show that this rate is an upper bound of the exact one. 

The study of the asymptotic behaviour of the approximate evolution involves the lin- 
earized form of Eq. (fT2|) . for which, as we shall prove, some properties of the exact kinetic 
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equations are still valid. To begin with we recall the algebraic structure underlying the 
near-equilibrium kinetics in the exact case. 

Let us consider, on the vector space V\ of all real functions on Ca, the scalar product 



271] that if the transition probability satisfies the detailed balance principle, that 



It is known 
is 

W^{x' ^ xK(x') = W^a(x ^ x')pl{x), (28) 
then the linear operator Wa : Va Va defined by 

(WA0)(a;) = W[^{x' x)(t){x') (29) 

is self-adjoint and negative semi-definite with respect to the above scalar product. Moreover, 
if W\ is irreducible then (0|Wa0) < provided that is not proportional to and the 
exact relaxation rate k^^ is given by 



A;- = - max : G Va \ {0} and {<P\pl) = j . (30) 

Let us move now to our approach and attempt to extend these results. From now on 
we shall assume detailed balance (1281) . Let us intoduce the vector space Va of collections 
u = {ua}a<^A of real functions on Ca, for all a E A, satisfying the conditions J2x Ua{xa) = 
up{xp) if /3 C a and Xlx^, ""0(^0) ~ 0. Notice that, since is strictly positive and thanks 
to the previous conditions, for m G Va it is possible to find eo > such that + eu E "Da if 
e G (—Co, Co)- On the space Va we can define a scalar product through the bilinear form 



{u\v) = ^aa}^ ■ (31) 



It can be checked that {u\u) > for m G Va \ {0}. Indeed, for m 7^ 0, 



{u\u) = -^FaIp" + (^u] 



(32) 

e=0 



as it can be easily verified, and the result follows by the fact that p'^ is the minimun of the 
free energy Fa. Let be the norm generated by this scalar product, i.e. = ^y {u\u). 
Observe that the application U\ : Va V\, defined by 



UA[u]ix)= ^^P^p- + eu]ix) 
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= ^1(-)E«"^' (33) 



preserves the scalar products, since 

mu]\u4v]) = J2I1 

d 



U — ^ — ^ 



e=0 



e=0 



{u\v). 



(34) 



The local equilibrium relaxation rate appears naturally through a linearization of the 
near-equilibrium kinetics. To deal with an evolution in the space Va, we set p* = + u*, 
such that G Va for all t G M+. Then we substitute Eq. (1121) with the equivalent one 

d 



dt 



= TAM* + i?A(M*) 



(35) 



where Ta : Va — ^ Va is the linear operator defined by 

{TAU)aiXa) = ^ X] W^ix ^ Xa)PA[p^ + eu]{x) 
x'6Ca 

= Wa{x' Xa)UA[u]{x'). 



e=0 



(36) 



Ta is the operator corresponding to the Jacobian of the r.h.s. of Eq. (fT2|) evaluated at p = p"^ 
and, as a consequence, there exists Xq C Va, neighborhood of 0, and a constant M > such 
that 

\\Ra{u)\\ < M\\uf (37) 

for all M G Xq. 

Ta inherits its properties from Wa thanks to the relation 



{u\TAv) = iUA[u]\WAUA[v]), 



(38) 



which can be immediately verified. It follows that Ta is self-adjoint and negative definite. 
The first property is obvious, while for the second one it is sufficient to observe that 



{u\Tau) = {Ua[u]\-WaUa[u]) < -F^(t/AM|t/AM) = -k'^u\ 



(39) 
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since 



Finally, defining k as 



7) ( X J 



= - max ( ME^ : n G Va \ {0} j (41) 
I {u\u} J 

we have by Eq. ([39]) A; > A;^^. 

Now we can show that k, the opposite of the maximum eigenvalue of the operator T\, 
is the approximate relaxation rate. We have just seen that it is not smaller than the exact 
one, a result that can be intuitively understood by observing that the local equilibrium 
assumption implies that we are dealing with an evolution in a restricted probability space. 

Proposition: Let m* = — be the solution of Eq. ( 135|) with initial condition = 
pO _pe_ n vanishes exponentially with time or, more precisely, with k defined by Eq. fj4Tl) . 
there exist v and -E* G Va such that 

= e-^^[v + E^] (42) 

and 

lim = 0. (43) 

t— >+oo 

Proof. We shall first find an exponential bound for by showing that there exist 

to > and A > such that, for t > to, G To and 

\\u'\\<Ae-'''. (44) 

Since lim = 0, given a positive constant b one can find to > such that G Xq and 
< b for t > to- Then, multiplying equation ( l35i) by m* and for t > toj we have 

= («*|TaW*) + (w*|i?A(«*)) 

2 at 

< -k\\u'f + M\\u'f 

< -{k- Mb)\\u'f, (45) 

which can be integrated to give 

\\u'\\ <e-('^-^^'')(*-*°)||M*"||. (46) 
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Inserting the above result in the first inequahty of Eq. (H5|) we obtain 

1^||^*||2 < _A;||tz*f + m||m*«||V3('=-^'^^)(*-*°), (47) 



which can be integrated again to give 

k - 3Mb 



The constant b can be chosen small enough to ensure that k — 3Mb > 0, and hence A in Eq. 
can be given by 



A = e*^*» 



I to 2 ^ U \i_ 



1/2 



(49) 



k - 3Mb _ 

Finally, since m* can be expanded in eigenvectors of T\, our proposition is proven if we 
show that lim {w\u^)e''^ exists finite for any eigenvector w with \\w\\ = 1. If this is the case 

t— >+oo 

V and are given by lim u^e^^ and u'e'^* — v respectively. It can be noted that v has to 

t— >+oo 

be an eigenvector of Ta relative to —k. 

Eq. (jH]) tells us that {w\u^)e''^ is bounded. If w corresponds to —k, let us consider 
an increasing sequence {tnjneN diverging to infinity such that to is the value previously 
introduced. From Eq. fl5B]) we have 

^{w\u') = -k{w\u') + {w\Ra{u')). (50) 

Using Eqs. (1371) and (jH]) and integrating between t„ and t„+m we obtain 

ma^ 

|(w|M*"+™)e''*"+'" - (iL'|M*")e''*"| < e"''*" (51) 

k 

for all m G N, which shows that the sequence {{w\u^")e^^"}n&n is a Cauchy one. Then the 
limit lim {w\u^")e''^" exists finite and thus, from the arbitrariety of the sequence {tnjneN; 



also lim {w\u )e does it. 

If w corresponds to —A < —k then 

j^{w\u') = -X{w\u') + {w\Ra{u')). (52) 

Using again Eqs. (1371) and (jHj) and integrating, with manipulations similar to the previous 

ones, one can find two constants B > and C > such that, for t > t^, 

\{w\u') \ < B e-^''' + C e-^\ (53) 
from which follows that lim {w\u^)e^^ = 0. 

t— >+oo 

13 



IV. LOCAL EQUILIBRIUM KINETICS FOR THE WSME MODEL 



After studying some features of the approximation, we are going to show its performance 
with an example, i.e. a particular, physically relevant, choice of the transition matrix. The 
goal is the reduction of the computational complexity of the kinetic problem from exponential 
to polynomial in the number of variables. We focus on a single bond-flip kinetics, that is we 
consider kinetics with only transitions between configurations that differ for no more than 
the state of one peptidic bond, and for which the detailed balance principle (128|) is satisfied. 

Given a; G Ca we denote by /ifc(x) the configuration obtained by x by turning the variable 
Xk^k, i-e. rrik, into 1 — Xk^k- We have explicity 

{Xi j a j < k or i > k; 
(54) 
Xi,k-i{^ - Xk,k)xk+i,j otherwise. 

A single bond-fiip kinetics will be described by a transition matrix with the property Wa{x — *• 
x') = if x' ^ {x,fii{x), . . . ,/iAr}. For the matrix elements corresponding to the allowed 
transitions we set 

Wa{x fikix)) = iy{HAifikix)) - HAix)), (55) 
where z/ : M ^ M is a strictly positive function which satisfies detailed balance: 

u{-A) = e^z/(A). (56) 

Condition (ITU]) requires that 

N 

Wa{x ^x) = -J2 ^{HaMx)) - Ha{x)) (57) 

k=l 

and it is easy to check that Wa is irreducible. The family of u functions fulfilling Eq. (156|) 
contains Metropolis and Glauber kinetic prescriptions, that are 

z/(A) = min{l,e-^} (58) 

and 

z/(A) = l/(l + e^) (59) 

respectively. 
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To show the form of approximate kinetic equations we must exphcitly write the r.h.s. 
of Eq. f|T5l) . i.e. the function / defined in f[T6l) . Writing probabihties as functions of the 
expectation values 

= E ^^^^PM (60) 



xGCa 



as anticipated in Sec. [TTl / takes the form 

N 

fi,jiO = E E " Xij)u{Hf,{nk{x)) - H|^{x))P^[p\{x) (61) 

k=l xGCa 

where Eqs. (1551) . (!57|) and the substitution of x with fik{x) in the first term of the r.h.s. have 
been used. Since fik{x)ij = Xij ii k < i or k > j we can also write 

j 

fijiO = E E ~ Xij)iy{HAifikix)) - HAix))PA[p]{x). (62) 

k=i zSCa 

In the following we will need the new variables Sij, which take value 1 if i and j are disordered 
peptide bond which delimit a string of consecutive ordered bonds. These variables are defined 

as 



fc=j+l 



1 - if i = j; 

(1 - Xi,i)(l - Xjj) if j = i + l; 

(1 - Xi,i)xi+ij^i{l - Xjj) ifO<i<j-l<N, 

(63) 

where we assume the boundary conditions mo = mN+i = which imply Xij = if i = or 
j = + 1. It is not difficult to check that 

N i 

XiJ = E E Sr-l,s+l{x) (64) 



s=j r=l 



and 



N k N k 

j=k i=l j=k 1=1 

where, for l<i<k<j<N,we have introduced the quantities 

s=k r=i 



(65) 



(66) 
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Now we can return to Eq. (162|) . Notice that, for i < k < j, 

/ N k 



^s=k r=l 



(N k \ / N k \ 

hr,sl^k{x)r,s I - a;i,jZ/ I - /ir,sa;r,s I ■ (67) 

s=fc r=l / \ s=fe r=l / 

The first equahty follows by the relation fik{x)ij = Xij ioi i > k or j < k. The second 
one by the fact that if i < k < j and Xij = 1 then fik{x)r,s = for r < k and s > k, and 
vice-versa. 

We will concentrate only on the term 

N k 



XijU 



X] KsXr^s (68) 



s=k r=l 

since we can obtain the first one by replacing /ijj with —hij and x with iXk{x). Eq. (165|) 
allows us to write 

(N k \ / N k \ 

- ^ ^ K^sXt^s I = a;j,jz/ I ~ X] X] ^r,s'S'r-i,s+i(a;) | . (69) 
s=k r=l J \ s=k r=l / 

The last equality is nontrivial only for configurations x E C\ such that Xij = 1, and hence 
Xk,k = 1- We now observe that, if Xk,k = 1, varing i in {1, . . . , A;} and j in {k, . . . , A^} one 
can find one and only one couple of indices (r, s) such that Sr^i,s+i{x) = 1. This couple of 
indices delimit the string of adjacent ordered bonds containing k. It follows that 

(N k \ N k 

^r,sXr,s J = Xij ^ ^ Z/(- J (z) . (70) 

s=k r=l / s=k r=l 

Finally, observing that for i < k < j and r < k < s 

{0 if r > i OT s < j] 
(71) 
1,5+1(2;) otherwise, 

we reach the result 

N k \ N i 



X 



j =J2 J2 ^^(-A,',J5,_i,,+i(x) (72) 

s=k r=l / s=j r=l 



that makes possible to rewrite / in a simpler way. 
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Substituting Eq. (1721) in Eq. (167|) and using Eq. (l62i) we obtain 

j N i 

k=i s=j r=l xGCa 
j N i 

- EEE^(-0 E Sr-l,s+lix)P^\p]ix) (73) 
k=i s=j r=l xGCa 

that contains expectation values of two kinds of observables. The sum 

J2 Sr-l,s+l{x)PA[p]{x) (74) 

is easily computable since it is the expectation value of a linear combination of Xjj's. With 
a slight abuse of notation we will still denote this sum with 5*^-1,3+1 (0- O'^^ can verify that 

1 -^ij if i = j; 

S^AO = { 6,, - - + 1 if J = 2 + 1; (75) 

- - ^i+ij + ^i+ij-i if < i < J - 1 < A^, 

where we assume = 0ifi = 0orj = A^ + l. 
The computation of 

^ 5'r-i,s+i(yUA:(a;))^A[p](a;) (76) 

xGCa 

is slightly more involved. Notice first that for r < k < s 

Sr-l,s+l{lJ'k{x)) = Sr-l,k{x)Sk,s+l{x), (77) 

which vanishes if Xk^k = ^k = 1- Let us then define the clusters 

Ai = {{t,j)eA:j<k}, (78) 
= {{z,j)eA:j>k}, (79) 
7^fc = {it,j)eA:j = k}, (80) 

depicted in Fig. [T], and introduce their probability distributions 



E ^a[pK^), (82) 
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FIG. 1: Graphical representation of the clusters A^, and TZk 



Exploiting the constraints in the same way as in [9| it is not difficult to obtain the relations 



Pa \p\ [x 



and 



P7^fc(0, ...,0) =p(fc^fc)(0) = l-^k,k- 
The quantity in (1761) can then be rewritten as 

Sr-l,s+l{fJ'k{x))PA[p]{^. 



(84) 



(85) 



[X 



xGCa 

Sr-l,k{x)Sk,s+lix)PA[p]ix) 



xgCa 



y^ Sr-l,k{x)Sk,s+l^ 



X] 



p^d{xA{)Phl{xAl, 



rrr- ^r~lA^)Sk,s+l{x)pAd{xAci)pA-^{xA{, 



(86) 



since the only configurations which contribute to the sum are those with Xk^k = 0. In 
addition, due to the vanishing of Xk^k, the above sum can be factored as follows: 



^ Sr-l,s+lMx))PA[p]ix) 



x€Ca 



^ \ Yl Sr-l,k{xAl)PAl{xKl) Y ^k,s+l{xAl)P\l{xAl] 
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' x^Ca xECa 
^ '5'r-l,A:(0'5'fc,^+l(0 ^g^^ 

Plugging Eq. fl87|) into Eq. fl73|) we find for / the final form 



k=i s=j r=l 



/^From the above results one sees that the computational complexity of the r.h.s. of 
Eq. (JT5I) is polynomial in the number of variables, and precisely of order A^^. Moreover it 
must be mentioned that writing directly an equation for the variables Sij the computational 
complexity lowers to the order A^^, which is useful in practical computations. 

V. EQUIVALENCE BETWEEN LOCAL EQUILIBRIUM APPROACH AND 
PATH PROBABILITY METHOD 

We conclude our paper by showing that for the WSME model there are no differences 



26|, a 



between local equilibrium approach and path probability method (PPM) [24, 
variational approximate technique for the study of out of equilibrium systems that generalzes 
the CVM to the kinetics. 

The PPM has a natural application to discrete time evolution, hence it is useful to turn 
to this one and then to recover the continuous case with a limiting process. We say that 
Ta{x' — > x) > is a transition probability from the configuration x' to x if 

J2Ta{x' -^x) = l. (89) 

x€Ca 

If W\ is the transition probability per unit time as listed in Eq. let T( be a transition 
probability such that 

Tl{x' -^x) = + tWa{x' ^x)+ t^EKx' x) (90) 

where EJ^ is a bounded function of r when t —>■ and 6 the Kronecker symbol. The discrete 
version of Eq. then reads 

PA^i^)= E Tl{x'^x)pi{x') (91) 
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for all t G {fiT^nm ^'Hd, as a consequence of Eq. (l90i) . given t G M+, in the limit r — and 
n ^ oo such that rar t, the solution of this equation reduces to that of Eq. ([9]) with the 
same initial condition. 
In PPM 



24 
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261] one assumes that has the form 

Tlix' ^x) = eX(x') n K« - ^a)r , (92) 

where Q\ ensures normalization ( !89l) and ^ and the same of Sec. [Tll 

Let V\ be the set of functions q = {qa}aeA such that : Cq, x Cq, —> M is not negative 
and V^, Qaix',Xa) = Qaix'fj.Xg) for /5 C a. For g G Pa define then the function 

QaI?] : Ca X Ca ^ M+ as 

QA[q]{x',x) = ll[qa{x'^,x^)Y'^. (93) 
It can be checked that qa is the marginal of QaIq] relative to the cluster a, i.e. 



) = ^ ^ QA[q]{x\ x). (94) 

The quantity T1{x' — > x)pa(3;')) where Pa(^') is the solution of Eq. fl9T]) . has a physical 
meaning: it is the joint probability of finding the system in the state x' at time t and in the 
state X at time t + r. The idea of the PPM is to replace this probability with an approximate 
one, which factors as in Eq. (193!) . i.e. to replace T^(x' — » x)p\{x') with Q\[q^''^]{x',x), and 
then to compute the marginal distribution at the time t + r as 

g^'"^ is then an approximation of the above joint distribution relative to the cluster a. The 
function g*'"^ must contain information about the state at time t and the kinetic prescription. 
As a consequence of the relation 

x6Ca 



■^A\a 



we impose to g '"^ the constraints 



E E <5A[g"1(x',a;) = ^g^'-(x:„x„) =pl(a;'J, (97) 
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that it has to be satisfied for all a ^ A. 

The kinetic generalization of the Kikuchi free energy Fa is 

^aIq] = ^aa^^[\nqa{x'^,Xa) -lnTJ(x^ ^ Xa)]qaix'^,Xa), (98) 

and g*'"^ is chosen as the distribution q G T>\ which minimizes K]^ with the constraints (1971) . 
If the factorization assumption were exact, this choice would give the exact marginals of the 
joint probability T]^{x' x)p\{x'). 

It can be checked that conditions lim q]^{x'^, Xa) = Sx^,x'^Pa{xa) hold for all a E A. These 
allow us to summarize PPM, for continuous time evolution, as follows: 



d 



d 



T = 



(99) 



g,t,r ^ minarg{_ft'^[g] : q G ^^Ib*]}, 

where, for p G V^, V\[p\ denotes the set of g G V\ such that Xlx^i ~ Vaixa)- 
In the following we shall prove the equivalence between the local equilibrium approach and 
PPM by showing that the problem (!99|) leads to Eq. (fT2|) . For this purpose we need the 
following proposition. 

Proposition: Let q'^ be the minimum of the function KJ^ in I^^M- Then a function 
R\ : Ct^ M+ exists such that 



QA[gn(x',x) = i?X(a^')^I(^' ^ a;)PA[p](x') 



Moreover i?]^ can be written as 



Rl{x) 



1 + rrl ix) 



(100) 



(101) 



where r]^ : Ca M is a bounded function of r, when r approaches zero, with the property 

J]rX(x)PA[p](x) = (102) 



for all a E A. 

Before proving the proposition, we show how it leads to the equivalence of the two meth- 
ods. From now on we denote by (f the minimum of K\ in I^aM- want to show that 



d 



T=0 



J2 W^ix' X^)PA[p]ix') 
x'GCa 



(103) 
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from which it follows that the PPM problem (p9l) coincides with the local equilibrium ap- 
proach, Eq. ([12]). Eqs. ffTOOD . ffTOTj) and ([90]) allow us to obtain 

Q^[q^]{x',x) = Rl{x')Tl{x' ^ x)P^[v\{x') 

= + r5,,.'rX(x') + tWk{x' ^ x) + r^El{x' ^ x)] Pk[p]{x'), (104) 

where, as before, E\ is a bounded function of r when r approaches zero. Then, summing 
and using Eq. (I102p . we have 

a:A\a a;'eCA 

= Pa{Xa) + r J] J] W^a(x'^x)Pa[p](x') 

+ ^'E E^a(^'^^)' w 

2:a\q a:'eCA 

that, remembering the definition ([T3]) of Wa{x' — > Xq,), gives the expected result (11031) . Let 
us move now to the proof of the proposition. 

Proof. Let us denote by Am the family of clusters, the maximal clusters, that are not 
enclosed in any others and by Am those which are intersections of at least two maximal 
clusters. Notice that AAiHAm = and AM^Am = A. In the case of the WSME model Am 
is the set of all square plaquettes and the triangles lying on the diagonal boundary, while 
Am is the set of internal nearest-neighbour pairs and single nodes. 

In addition, given /3 G Am, let us introduce the set 

AIj = {ae Am -.pea}. (106) 

In the same way, for a G Am, 

A^ = {(3eAm:f3c a}. (107) 

The minimization of KJ^ in 2^a[p] is addressed by the method of Lagrange multiplyers. 
We consider the function 



Kl[q-\] = i^I[g]-$^$^A„(xL) 



aeA x' 
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. (108) 



Taking derivatives with respect to the variables qa{x'^,Xa) we obtain the equations 



aalng^(x^,x„) - a„lnTJ(x^ ^ Xa) + ^ A^^^(x/3, a;^) - A^(x^) + = (109) 



and 



a/3lng^(x|3,X/3) - a/3lnTJ(x^ X/g) - ^ A;^^(x;3, a;^) - A^(x^) + = 0, (110) 

that are satisfied by q'^ for a G and /5 G Am- Summing now the first over Am, the 
second ones over Am and the results together, we arrive at Eq. fllOOl) : 



= Rl{x')Tl{x' x)Pk[p\{x') 

with a suitable function R\. 

The relations lim g^(x^, Xq.) = (5xQ,x^Pa(a;a) and limT^(x' — » x) = 5x,x' imply then 



(111) 



\imRl{x) = l 



(112) 

exists such that Eq. 



and from Eq. (11 111) it follows that a bounded function r]^ : Ca — > 
(ITOTil holds. 

We conclude the proof by showing the validity of Eq. (11021) . Using Eq. (155]) we find 



^gA[g1(x',x) = i?X(x')PA[p](x') 



(113) 



and using Eq. (llOip we obtain 



5^i?X(x')PA[p](a:') 

Y.PM{x') + rY,rl{x')P^[p]{x') 



X) 



''A\a 



''A\a 



Pa[X 



O + r J]rX(x')PA[p](x') 



;ii4) 



from which Eq. (I102p follows. 
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VI. CONCLUSIONS 

We have given proofs of a few rigorous results we have announced in fl|, concerning the 
properties of the local equilibrium approach to the WSME model. In particular, we have 
proven that (i) the free energy decreases with time, (ii) the exact equilibrium is recovered 
in the infinite time limit, (iii) the equilibration rate is an upper bound of the exact one, 
(iv) computational complexity is polynomial in the number of variables, and (v) the local 
equilibrium approach is equivalent to the path probability method. We have also reported 
the detailed form of the kinetic equations. 

It is important here to stress that our proofs depend only on the assumption of a fac- 
torization property for the probability distribution of the model, and hence can be easily 
carried over to other models with the same property. 

Our results also shed new light on the physical meaning of the path probability method. 
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